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Abstract. We explore the geometric notion of prolongations in the setting of computational 
algebra, extending results of Landsberg and Manivel which relate prolongations to equations for 
secant varieties. We also develop methods for computing prolongations which are combinatorial 
in nature. As an application, we use prolongations to derive a new family of secant equations 
for the binary symmetric model in phylogenetics. 



1. Introduction 

The notion of prolongation originated with Cartan in the context of differential geometry [JJ 
[T0| I12j. We give the simplest formulation of the definition below. Since we are using differential 
operators, we will assume that our ground field K has characteristic zero. 

Definition 1.1. Let S = . . . , x n ], and let A C S d be a vector space of polynomial forms 

of degree d. The r-th prolongation of A, denoted by A^ r \ is 

If € S d+r | G A for all (3 e N n with \/3\ = r\ . 

In this paper our interest lies not in the role of prolongation in differential geometry, but 
instead in exploring the applications of purely algebraic reformulations of the definition to three 
areas: algebraic geometry, commutative algebra, and phylogenetics. In particular, we will ex- 
plain and generalize results of Landsberg and Manivel [13] connecting prolongations and secant 
varieties, as well as apply these ideas to the computation of some nontrivial secant equations 
arising in phylogenetics. 

Recall that if X C IP" -1 is a projective variety, Sec r (X) = is the Zariski closure of the 
union of all r — 1 planes spanned by r points in X. Let / = I(X) and suppose that the smallest 
degree of a minimal generator of / is d. If A = I^, the significance of the prolongation A^ 
comes from connections to secant ideals of I. 

Theorem 1.2. Let X C p n_1 be a variety over an algebraically closed field of characteristic 0, 
with I = I(X) C (xi, . . . ,x n ) d , and A = I d . Then A^" 1 ^- 1 ^ is the degree r(d - 1) + 1 piece 
of the ideal of the secant variety X^ . 

Theorem 11.21 generalizes Lemma 2.2 of |13j which concerned prolongations of spaces of qua- 
dratic forms. In the subsequent paper [13], Landsberg and Manivel allude to a generalization 
of their lemma for higher degrees, but never give a precise statement. At the heart of the proof 
of Theorem 11.21 are connections relating prolongations to polarizations of homogeneous forms 
and to symbolic powers of ideals. Relationships between symbolic powers and secant ideals 
are not new; for example containment of secant ideals in symbolic powers appears explicitly in 
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Proposition 2.1 of Catalano-Johnson in [6] and Corollary 4.6 of Vermeire in |20| shows that if 
X is smooth, a graded piece of a high enough symbolic power cuts out its secant line variety 
set-theoretically. 

One reason that Theorem 11.21 is useful is that it allows for the straightforward computation 
of equations that belong to the secant ideals I(X^) using linear algebra. Note that by [15], 
r(d — 1) + 1 is the lowest degree in which there can exist a nonzero polynomial in I(X^). Even 
if I is generated by A = I a, it need not be the case that yl((rf-i) ('-!)) generates I(X^). In 
spite of this, in several instances of practical interest, the prolongation provides many nontrivial 
equations in secant ideals that are difficult to derive directly from the definition of the secant 
variety. Thus, prolongation at least brings us one graded piece closer to understanding secant 
ideals. 

We conclude the paper by returning to our original motivation which was the relationship to 
algebraic statistics, where algebraic varieties are interpreted as statistical models. Passing to the 
secant variety in algebraic geometry amounts to taking a mixture model in statistics. We will use 
prolongations as a tool for describing non-trivial secant equations for the binary symmetric model 
in phylogenetics, which has received attention recently in the algebraic geometry community 

mm- 

This paper is organized as follows. We describe several equivalent defintions of prolongation 
in Section 2, ending with a proof of the relationship between prolongations and symbolic powers. 
In Section El we describe algorithms for computing prolongations. Theorem II .2\ which connects 
prolongations and secant equations will follow from results in Section [H Part of our proof follows 
along the lines of ideas from [13] and [H], which we attempt to make more explicit, and part 
of the proof depends on a new application of the join of two ideals. We derive some nontrivial 
secant equations for the binary symmetric models in Section [5j 
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Notations and Conventions 

Since we are working with differential operators, unless explicitly stated, we assume that K 
is a field of characteristic zero. In situations where we need K to be algebraically closed, we 
explicitly state this. 

All varieties X are projective and reduced, but they need not be irreducible; that is, we 
assume that the ideal I{X) is homogeneous and radical, but not necessarily prime. 

Let N denote the non-negative integers. If a = (a±, . . . ,a n ) € N n , then a\ = ail ■ ■ ■ a n \. If 
\a\ = d, we let (^) denote the multinomial coefficient with parts a±, . . . ,a n . We write 1 to denote 
a vector in which each coordinate is 1. A monomial in the polynomial ring K[a;i, . . . , x n ] = K[x] 
is given by an element a £ N n where x a = x" 1 ■ ■ ■ x" n . 
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2. PROLONGATIONS: EQUIVALENT DEFINITIONS 



In this section we focus on making the translations between purely algebraic descriptions of 
prolongation and Definition 1 1 . 1 1 explicit . The importance of the algebraic definitions is that they 
allow us to connect the notion of prolongations to commutative algebra and algebraic geometry, 
specifically, to symbolic powers and equations defining secant varieties. 

In the interest of keeping our introduction to prolongations self-contained, we briefly review 
the definition of the symmetric algebra in terms of the tensor algebra as well as the basics of 
polarization although this material will be well-known to some. We define prolongations from 
the quotient algebra point of view in Section 12.11 We translate the definition into the language 
of polarization in Section 12.21 Finally, we illustrate how thinking of prolongation in terms of 
polarization leads to connections with symbolic powers. 

2.1. Prolongations and the symmetric algebra. In this section we will recast the definition 
of prolongation in terms of the symmetric algebra, viewed as a quotient of the tensor algebra. 

Let V be a finite dimensional vector space over a field IK, and V* denote its dual. We follow 
the conventions of Appendix B of [9] . The reader may also want to consult Appendix A2 of [8] 
or Chapter 1 of [21]. 

We let T = © d>0 T d V* be the tensor algebra of V*, where T d V* denotes the tensor product 
of V* with itself d times. The symmetric algebra S, on V* is defined to be the quotient of T by 
the ideal {x ®y — y x \ x,y £ V*). If we pick a basis x = (x±, . . . , x n ), for V* , then we may 
identify S with lK[a?i, . . . , x n ], the homogeneous coordinate ring of FV. A monomial x a of degree 
d represents the equivalence class of tensors which map to it under the canonical projection 
T -> S. 

The co-multiplication or diagonal map 

gd+ry* _^ gdy* g, gry* ^ bg important in wria t 

follows. First, we describe it using the intrinsic point of view of Appendix A2.4 in [8]. Recall 
that the diagonal map A : S — > S ® S sends i6 7*hj01 + 18i. We get a map A.^ r '■ 
gd+ry* _j, gdy* gry* restricting the diagonal map to S d+r V* and composing this with 
the projection to S d V* ® S r V* . For example, since 



we see that the co-multiplication map A2,i sends x 2 y to 2xy ®i + i 2 8i/6 S 2 V* ® S 1 V*. 

Following pg. 5 of [2TJ we can also describe the co-multiplication map by its action on mono- 
mials in terms of our basis x. If i\ < • • • < irf+ r , then 



where we sum over all permutations a of d + r elements such that <r(l) < • • • < cr(d) and 
a(d + 1) < • • • < a(d + r). 

The reason that co-multiplication appears in connection with prolongation is that it is closely 
related to partial differentiation. 



A(x 2 y) = (x ® 1 + 1 ® x) 2 (y ® 1 + 1 ® y) 

9 9 9 9 

= x y ® 1 + 2xy ®x + y®x + x ®7/ + 2x®xy-(-l®a;?/, 




Lemma 2.1. If F € S d+r V*, then A d , r (F) = 

\/3\=r 



1 d r F 
/3!dx^ 



® x' 
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Proof. By linearity it suffices to assume that F is a monomial F = x a . The projection of A(x a ) 
to S d V* (g) S r V* is the sum of all of the terms of A(x a ) which can be written in the form — tg)x^ 
with 1/31 = r. Since 



A(x a ) = Y[(xi ® 1 + 1 <8> Xi 



i=i 



there will be (?) • • • (^ n ) terms in the product of the form — <S> x^ 3 , all equal to x a & ® x^. But 



□ 



We can use co-multiplication to see that the following algebraic definition of prolongation 
given in Section 2.1.3 of [13] is equivalent to Definition II .![ 

Lemma 2.2. If A C S d V*, then = (A S r V*) n S d+r V*. 

Proof. Note that A^ r maps F € S d+r V* to an element of the form 

Fp®^ e S d V*®S r V*. 

\P\=r 

This is in A ® S"*!/* if and only if each i 7 ^ G A, and by the previous lemma, Fp = ^ 

2.2. Prolongations and polarization. In this section we explain the connection between 
prolongation and polarization. Polarization, which arose in classical invariant theory [22], is the 
higher degree analog of associating a symmetric bilinear form to a quadratic form and is closely 
related to the representation of a homogeneous form as an element of the tensor algebra. The 
notion of polarization is also used in §1 of Chapter VI in [2 J in connection with secant varieties 
of curves. What is significant for us is that thinking of prolongation in terms of polarization 
opens the door to connections with symbolic powers and with secant varieties. 

Definition 2.3. Suppose that F is a homogeneous polynomial of degree al in K[x] where x = 
(xi, . . . , x n ). For each i = 1, . . . , d we introduce a new set of n variables x^ — (x^i, • • • , ^m)- 
We also introduce an auxiliary set of variables t = (t±, . . . , to). The polarization of F, denoted 
F(xi, . . . , Xrf), is the coefficient of t 1 in the expansion of F(tiXi + • • • + td x d) as a polynomial in 
t. 

Example 2.4. Let -F(x) = xfx2- We compute 

F(£iXi + i 2 x 2 + t 3 x 3 ) = (hx n + t 2 X 2 l + t 3 X 31 ) 2 (tiXi2 + t 2 X 2 2 + t 3 x 32 ) 
= tfx^Xu + 4x^X22 H 

+ 3*1*2(^11^22 + ^11^21^12) H 

+ ht2t 3 (2xnX2ix 3 2 + 2x11x31x22 + 2x21x31x12). 

We see that F(xi,x 2 ,x 3 ) = 2x11X21X32 + 2xiix 3 ix 2 2 + 2x 2 ix 3 ixi2. 

Lemma 2.5. If F(x) is a homogeneous form of degree d, then 
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(1) 

F(t lXl + • • • + t d x d ) = ^F(xf ,. . . ,xf) 

|/3|=d 

where x^ 1 means that the set of variables Xj is repeated 0i times. 

(2) F(xi, . . . , Xrf) zs linear in each set of variables Xj. 

(3) F(x\, . . . ,x d ) is symmetric in the Xi . (If a is a permutation of al elements, then F(xi, . . . ,x d ) = 
F(x (7 ( 1 ), . . . ,x a ^).) 

(4) F(x,...,x) = d!F(x) 

Proof. Note that it is enough to prove the stated claims in the case where F(x) = x a . In this 
case 

n 

(1) F(t a xi + • • • + t d x d ) = Y[{t lXlj + ■■■ + t dXdj ) a i . 

j=l 

For part (1), recall that F(xi, . . . ,x d ) is the coefficient of t 1 in the product consisting of d 
factors of the form {t\X\j + ■ ■ - + t d x d j) as above. From this definition, we see that the coefficient 
of t 1 is a sum of dl monomials (counted with multiplicity) which correspond to the d\ ways of 
choosing one term per factor, where each tj is chosen exactly once. 

We can construct the dl monomials which are coefficients of t 1 as follows. Let the d factors 
in Equation ([1]) be F±, . . . ,F d . Fix a subset / C [d] of size 0. A monomial coefficient of t 1 is 
gotten by choosing terms of the form t^x^j from among the factors F s with s£ / where k G / 
and the k are all distinct, and d — (3 terms t^xij from among the factors Ft with t ^ I where 
i $l I and the £ are all distinct. Since there are (^) ways to choose the set /, [3\ ways to make 
our construction of a monomial from the factors F s with s € I and (d — 0)1 ways to construct a 
monomial from the factors Ft with t £ I, we see that we get each monomial coefficient of t 1 in 
this way. 

Let us consider F(x^ x , . . . ,x^ d ) where Xj is repeated /3, times. We can pass from F(xi, . . . ,x^) 
to F(x^ , . . . , x^ d ) by computing the result of repeating the i-th set of variables 0i times succes- 
sively for each i. 

Assume that / C [d] is of size and that for each k E I, t^x^j is replaced by tjXjj. In 
F(xi, . . . ,x d ) there are ways of choosing terms of the form tkXkj from the factors F s with 
s G I with k E I and the tf. all distinct, but if we replace each t^x^j by tjXjj, all of these fyl 
choices look the same. 

Now let us turn to the consideration of the coefficent of t@ in Equation (|TJ) . Note that there 
is only one way of choosing /3j terms of the form tiXij from among the factors F^ with k E I. 
Considering each of the d sets of variables in turn, we see that if we substitute the ith set of 
variables 0i times in F(xi, . . . , x^) we will see each monomial that appears as a coefficient of iP 
repeated (31 additional times. Therefore, the coefficient of t@ is ^,F(x^ 1 , . . . ,x^ d ). 

Parts (2) and (3) follow immediately from the definition of F(xi, . . . ,x^) as the coefficient of 
t 1 in Equation ([1]). Part (4) follows by computing F(x, . . . ,x) as the coefficient of t 1 in 

n 

F(t x x + ■■■ + t d x) = + • • • + t d ) Xi ) a * = (*! + ... + t d ) d x a . 

i=l 
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□ 

Observation. Recall that we may identify the elements of S d V* with elements of T d V* that are 
invariant under the action of the symmetric group on d letters. This point of view is especially 
important in [13| and [14j . Explicitly, if € V* , then 

vi ■ ■ ■ Vd >-> v a{1) ® • • • ® v a{d) . 

Note that the image of a monomial m € S d V* in T d V* is a weighted sum over all of the 
monomials in the coset represented by m, and m is \/d\ times the projection of this element of 
T d V* into S d V* . Therefore, we can easily pass from the expression of an element of S d V* as a 
d-tensor that is invariant under the action of the symmetric group on d letters to its polarization; 
we just write the elements appearing in the i-th factor in the tensor in terms of and erase the 
tensor symbols. For example, 

x\%2 i-» 2{x\ g) x\ g) x 2 + x\ <g> x 2 ® x\ + x 2 (8> xi (g) x\) i-> 2(xnx 2 ia;3 2 + X11X22X31 + xi 2 x 2 i3;3i)- 

The following elementary lemma describes relationships between polarization and partial dif- 
ferentiation. 

Lemma 2.6. Let F be a homogeneous polynomial of degree d + r. 

(1) IfF = x a , then 

PeM n ,\(3\=r V ^ 7 

(2) If (3 e N n with \(3\ = r, then 

8 r F ^F(x,...,x,y,...,y) 



d\r\- 



In both expressions we assume there are d copies o/x and r copies ofy. 

Proof. (1) When we polarize F, we get a [d + r)-linear symmetric form in d + r sets of variables 
Xj. Each of the ( d ^~) distinct monomials in F(xi, . . . , x^ +r ) appears with coefficient a\. For 
any choice of (3, there will be (o) distinct monomials which will agree (and have y@ as a 

factor) when the first d sets of variables are all set to x and the last r are set to y. 

(2) It is enough to prove the result for an arbitrary monomial of degree d + r. Assume, without 
loss of generality, that F = x°. Using (1) we see that the coefficient of y@ in F(x, . . . , x, y, . . . , y) 
is 

\cc-Pj\p) (*-P)\l3\ 
Therefore, we see that taking partial derivatives with respect to y@ yields 

d'.rl-. — x p = dlrl——;. 

(a- (3)\ dxP 
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□ 

The next lemma is a modification of an observation in [13J. (See also the discussion after 
Corollary 3.2 in [H].) 

Lemma 2.7. Let A C S d V* , and F G S d+r V* be a homogeneous polynomial with polarization 
F(x, . . . , x, y, . . . , y), of degree d in the x-variables. The following are equivalent: 

(1) F is in A^. 

(2) Every coefficient ofF as a polynomial in the y -variables is in A. 

(3) Every coefficient o/F(x, . . . , x, yi, . . . , y r ), viewed as a polynomial in all of the y -variables, 
is in A. 

(4) F(x, . . . , x, v, . . . , v) G A for every choice of v G V. 

Proof. First we show the equivalence of (1) and (2). We know that F G A^ if and only if 
S£ G A for every /3 G N n with \(3\ = r. But is just /3! times the coefficient of y@ in F by 
part (2) of Lemma 12,61 

The equivalence of (2) and (3) follows because the coefficient of the monomial y^ in the 
polynomial F(x, . . . , x, y, . . . , y), which has degree r in the y- variables, is (^) times the coefficient 
of some monomial in the r sets of y- variables in F(x, . . . , x, yi, . . . , y r ). 

For the equivalence of (2) and (4), note that we are working over an infinite field. If we 
choose { d+r d ^ n r ' 1 ) sufficiently generic points Vj, the vectors in indeterminates F a of the form 
^| Q ,| =r i ? aV" will be linearly independent. Hence, they are all in A, if and only if every F a G 
A. □ 

Lemma 12.71 allows us to prove the following result about the prolongations and ideals. 

Theorem 2.8. Let A be a subspace of S d V* and let I C K[x] be the ideal generated by A. Then 
F G S d+r V* is in A^ if and only i/geJ for every (3 G N n with \(3\ = k < r. 

Proof. One inclusion follows immediately from Definition ll.il For the opposite inclusion assume 
that F G A( r \ Write F(x, . . . ,x,yi, . . . ,y r ) as a polynomial in the variables y^ : 

F /J (x,...,x)y*. 

/3eN™ r ,|/3|=ft 

The symbol y above stands for the vector of vectors y = (yi, . . . ,y r ). 

Part (3) of Lemma l2.7l tells us that each -Fg(x, . . . ,x) is in A. Now set the variables yi, . . . , y r -k 
equal to x. Each 

F p (x, x)y^ ^ F /3 (x, . . . , x)x Q 'y^ a 
where a' G N n has i-th coordinate Ylj a ij- From this, we can see that the coefficients of the 
monomials in the remaining yj-variables are all in /. Setting the remaining y^-variables all equal 
to v, we see that the coefficients of the monomials in v will be in I and may be interpreted as 
partial derivatives of F of order k (up to a scalar) by part (2) of Lemma 12.61 We conclude that 
all partial derivatives of order k are in / for all k < r. □ 

Recall that if I C S is a radical ideal with Ass(J) = {Pi, . . . ,P m }i then the r-th symbolic 
power of I is defined to be 

jW --i^w-^ns, 
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where W = S\{P\ U • • • P m ). We define the r-th differential power of I to be 

I <r> = // G S | G / for all < r - lj . 

If K is algebraically closed of characteristic and I is prime, then by the theorem of Zariski 
and Nagata 1^ = I <r> . See Theorem 3.14 in [8] for a discussion of the proof and pointers to a 
more general statement in characteristic p. 

The theorem of Zariski and Nagata also holds for radical ideals. While we found this statement 
in the literature, we could not find its proof, so we include one for completeness. 

Corollary 2.9. If I is a radical ideal over an algebraically closed field of characteristic zero, 
then = I <r> . 

Proof. Suppose that Ass(I) = {Pi, . . . , P m }. It is easy to see that I <r> = nP i <r> , so by the 
theorem of Zariski and Nagata it suffices to show that 1^ = H-P^ . Since I is radical, by prime 
avoidance, Ass(/( r * ) ) = Ass(I), and the Pj-primary component of 1^ is (I^)p i n S. But, as 
localization commutes with products and intersections, we have 

(j(r)) Pj = (nw- 1 ] n s) Pi = {P) Pi n s Pi = (i P J = ((Pi) Pi )r = (if ) P , 

We see that (7 (r) ) p D S = (P[) Pi n S = P. (r) , which completes the proof. □ 

Thus, we have the following corollary to Theorem 12.81 

Corollary 2.10. Let K be an algebraically closed field of characteristic 0. Let I be a radical 
ideal with I C (x±, ■ ■ ■ ,x n ) d and let A = 1^. Then the (d + r)-th graded piece of /( r+1 ) is A^'\ 



3. Computing Prolongations 

In this section we describe algorithms for computing prolongations that use linear algebra and 
can be implemented in a computer algebra system. We also discuss how combinatorial tools can 
be used to speed up the computations by reducing the dimensions of the intermediate vector 
spaces that need to be computed. These combinatorial approaches can also be used to determine 
explicit descriptions of prolongations. 

3.1. Algorithms. We will describe some algorithms for computing prolongations which depend 
on various implementations of the equivalent definitions from Section [2j In practice, we will have 
a basis for A consisting of a set of homogeneous polynomials of degree d and will want to compute 
A^ r \ The crucial step in each of the algorithms for computing prolongations that we describe 
may be performed by Gaussian elimination. However, what is easiest to implement depends on 
the way polynomials are stored since converting polynomials to vectors which may be operated 
on by the user may be nontrivial in practice in any given computer algebra package. 

Algorithm 3.1. 

INPUT: A basis for A. 

OUTPUT: A basis for A^ C S d+r V*. 

STEP 1: Map a basis for S d+r V* into S d V* ® S r V* via the co- multiplication map. 
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STEP 2: Compute the intersection of A(£> S r V*, with the space constructed in step 1. Multipli- 
cation (just "erasing" the tensor symbol) maps a basis for this intersection into S d+r V* giving 
a basis for . 

Unfortunately, the simplest implementation of Algorithm 13.11 introduces a new set of variables 
to represent the terms to the right of the tensor symbol, which slows computation. Alternatively, 
we can exploit the connection between co-multiplication and partial differentiation to avoid 
introducing a new set of variables. 
Algorithm 3.2. 
INPUT: A basis for A. 
OUTPUT: A basis for A^ . 

STEP 1: For each /3 € W, with |/3| = r, compute Ap, the space of all forms of degree d + r 

whose partial derivative with respect to is in A. 

STEP 2: The intersection of the spaces Ap is equal to A^'\ 

Another alternative would require more extensive programming, but is potentially quite fast. 
The complexity of Algorithm 13.31 is governed by the amount of pre-processing necessary to 
coordinatize our basis vectors and the Gaussian elimination in STEP 3. 
Algorithm 3.3. 
INPUT: A basis B for A. 
OUTPUT: A basis for A^ . 

STEP 1: Fix a term order so that monomials form an ordered basis for S. Abusing notation, 
let A denote the matrix whose columns are the coefficients of the elements of B with respect to 
this ordered basis. 

STEP 2: Let F be the generic form of degree d + r. Let C be a matrix with a column for each 
|/3| = r. The column corresponding to (3 is the coordinate vector of with respect to our 
ordered basis. (After a suitable scaling of the basis elements, C is the Catalecticant matrix 
C(d,r;n).) 

STEP 3: Form the augmented matrix [A|C]. The space A^ is just the space of polynomials F 
for which the augmented matrix [A\C] is consistent. We find this space by putting A in reduced- 
echelon form which will give a linear equation on the entries of C for every zero row to the left of 
the bar in the augmented matrix. Solving this system of equations gives the coordinate vectors 
of a basis for A^ . 

3.2. Monomial Prolongations. In this section, we describe the prolongations of vector spaces 
spanned by monomials. The monomial case can be solved purely combinatorially and can be 
used as a tool for reducing the computational burden in the general case. 

Proposition 3.4. Suppose that A is spanned by monomials. A monomial x a is in A^ if and 
only i/x""^ £ A for all x^ 3 dividing x" with \(3\ = r. 

Proof. The differential operator maps monomials to monomials. If ~x? divides x Q then 
T^g-x a = Cx" - ^ for a nonzero constant C, otherwise ^x Q = 0. □ 

An important special case arises when d = 2. In this case, the generators of A have two types: 
squarefree pairs XiXj and pure powers xf. Let a C [n] denote the set of i such that x\ S A. We 
define a graph G(A,r) as follows: 
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Definition 3.5. Let A be a vector space spanned by quadratic monomials from K[x]. For each 
integer r > 0, we define a graph G(A, r) with r + 2 vertices for each indeterminate whose square 
is in A, and a single vertex for all other indeterminates. Formally, the vertex set of G(A, r) is 
the set of all pairs with i 6 [n], where j G [r + 2] if i G a and j = 1 otherwise. A pair of 
vertices (ii,Ji) (^2^2) is connected by an edge if x^x^ G A. 

The graph G(A, r) can be used to read off the generators of the prolongations A^ . 

Corollary 3.6. The induced subgraph of G(A,r) on vertices ■ ■ ■ , (v+2> jr+2) i> s a com- 

plete graph if and only if x% x ■ ■ ■ Xi r+2 is in the prolongation A^ . 

Proof. The set of vertices • • • , (v+2> jr+2) forms a K r+ 2 if and only if Xi k Xi l G A for all 

l</c<Z<r + 2if and only if each divisor of Xi x ■ ■ ■ Xi r+2 of degree r has quotient in A. □ 

Example 3.7. Let A be the span of x\ , X1X2, X2^3, ^2^4, ^3^4 £ K[a?].,a?2,a?3,a?4]. To compute 
A^ 1 ), we construct the graph G(A, 1): 

(3,1) 
I 



(4,1) 

The graph G(A, 1) contains five complete subgraphs K3. One is the subgraph of G(A, 1) on 
the vertices (1, 1), (1, 2), (1, 3) and this corresponds to the monomial x\ £ A^\ There are three 
triangles in the graph of the form (l,i), (l,j), (2, 1) and these all correspond to the monomial 
x\x2 G A^\ Finally the triangle (2, 1), (3, 1), (4, 1) corresponds to the monomial X2X3X4 G 
Corollary 13.61 implies that these three monomials span 

Corollary 3.8. Computing prolongations is NP-hard. 

Proof. Focusing on the case where M is generated by squarefree quadratic monomials, we see 
that the prolongations are determined by the complete subgraphs in a fixed graph G(M). In 
particular, A^ is empty if and only if the largest clique of G(M) has cardinality less than r + 2. 
However, determining the cardinality of the largest clique is NP-hard. □ 

Besides the connections to graph theory, the monomial case can be used as a tool for reducing 
the dimensionality of the computations described in Section [2j 

Proposition 3.9. Let A C S d V* be any vector space of forms of degree d, and let M(A) denote 
the span of the monomials that appear as a term with nonzero coefficient in some polynomial in 
A. Then A^ C M{A) (r l 

Proof. A monomial differential operator is injective on the set of monomials it does not kill. 
Thus, for every monomial x a of every polynomial in A^ r \ and every divisor x^ of x a there 
exists a polynomial / G A such that x" - ' 3 appears with a nonzero coefficient in /. □ 
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Proposition 13.91 can be useful for computations because the monomial case can be precom- 
puted combinatorially. Then when applying the algorithms from the previous sections, one can 
immediately eliminate any polynomials that arise in a partial computation that do not belong 
to M(A)( r \ Furthermore, the monomial case can be used as a theoretical tool to prove that 
certain prolongations are, in fact, empty. 

Example 3.10 (No 3-way Interaction). Recall that the toric ideal of the no 3- way interaction 
model is the kernel of the ring homomorphism 

4>imn ■ ^[xijk I i G e [m},k 6 [n]] -> K[aij,b ik ,c jk ], x ijk i-> aijb ik c jk . 

The no 3-way interaction model is an example of a log-linear model in statistics. Giving a 
complete description of the toric ideals I\ mn = ker <f>i mn is a challenging open problem in algebraic 
statistics that has been studied by many authors [HE]. It is known that the lowest degree of 
a minimal generator is 4 and that A = {Ii m n)i is spanned by the ( 2 ) (™) Q) binomials that are 
equivalent to 

^111^122^2123:221 ~ £112^121 ^211^222 

under the natural action of the product of symmetric groups Si x S m x S n on indices. 

Let M{A) be the space of quartics spanned by the monomials appearing in these binomials. 
We will show that M(A)^ = for all k. Proposition 13.91 then implies that A^ = for all k. 
Since the prolongation of a prolongation is a prolongation, it suffices to show that M(A)^ = 0. 
This, in turn, is equivalent to showing that there is no monomial of degree five in K[x] which 
is divisible by five distinct monomials from M{A). However, if we are given any three variables 
that are part of a monomial in M(A), there is a unique way to complete it to a monomial in 
M(A), which guarantees that no degree five monomials of the desired type could exist. 

Applying Theorem 11.21 we have shown that ^4.( 3 ( r-1 )) = 0, and hence that the degree 3r + 1 
piece of I t is zero for all r, I, m and n. This implies that these secant ideals cannot be generated 
in their lowest possible degree. □ 

Another useful property of the monomial point of view is that generation by circuits is pre- 
served when taking prolongations. 

Definition 3.11. Let A C S d V* be a vector space of polynomials and / £ A. The support of / 
is the set of monomials that appear with nonzero coefficient. The polynomial / is a circuit of 
A if any g E A with supp(<7) C supp(/) is a scalar multiple of /, in other words, / has minimal 
monomial support. We say that A is minimally generated by its circuits if the set of all of its 
circuits is a basis for A. 

Remark. Circuits are basic objects in matroid theory that generalize linearly dependent sets. 
Note that for polynomials there are two natural definitions of circuits. One is the definition that 
we have used, where we consider the set of polynomials as a vector space, and the connection 
to linear algebra is clear. Another definition of circuit for an ideal I is that a circuit of / is a 
polynomial such that the set of variables appearing in / is minimal with respect to inclusion 
among all nonzero polynomials in /. If / is a prime ideal, this leads to the notion of an algebraic 
matroid. This is the definition of circuit which appears, for instance, in Chapter 4 of [TBJ, but 
this is not the notion of circuit that we mean. 
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Note that A C S d V* is minimally generated by its circuits if and only if the monomial 
support of any two circuits with distinct support are disjoint. Indeed, suppose that / and g are 
two circuits that contain the monomial x a with coefficient one. Then the polynomial / — g G A 
does not contain x a , The new polynomial can be written as a linear combination of circuits with 
support in supp(/ — g). But this implies that either / or g is not needed as a minimal generator. 
We show below that the property of being minimally generated by circuits is preserved under 
prolongation. 

Proposition 3.12. If A is minimally generated by its circuits, then so are the prolongations 
AM. 

Proof. It suffices to show that if x" is a monomial that is in the support of some circuit of AM ; 
there is no other circuit of AM containing x a in its support. Suppose to the contrary that there 
are two circuits / and g that contain a monomial in common. Let S be the set of monomials 
appearing in both / and g. Let x a be any monomial in S, and let be any divisor of x" of 
degree r. The derivatives and T^f- are thus nonzero and in A. Moreover, they must both 
contain a multiple of the same circuit h that contains x a "^, because A is minimally generated 
by its circuits. This means that if x 7 appears in h, then x^ 3+ ~ ) ' appears in both / and g, and 
will belong to S. 

Now let fs and gs be the polynomials obtained from / and g by summing those terms 
corresponding to elements of S. We will argue that if x' 3 has degree r, then ^q£§- is in A. Indeed, 
by the argument in the preceding paragraph, if x^ 3 divides an element of S, we get an element 
of A, and otherwise = 0. However, since / and g were circuits, this implies that fs = f 
and gs = g- Since / and g are circuits with the same support, they must be nonzero multiples 
of each other. □ 

Proposition 13.121 is useful in special cases for proving that we have determined a complete 
generating set for a particular prolongation. 

Example 3.13. Let I be the ideal of the Segre embedding of P ni_1 x P" 2 " 1 into P n xn 2 -i_ The 
ideal / is generated by the 2x2 minors of the generic matrix X = [x%j). Let A be the space 
of quadrics spanned by these 2x2 minors. Note that each 2x2 minor is a circuit, and these 
circuits have disjoint monomial support. By Proposition 13. 121 AM i s generated by circuits which 
have disjoint monomial support. 

The monomials appearing in the 2x2 minors generating I are precisely the monomials 
x hji x i2j2 such that i\ ^ %i and j\ / ji. The cliques in the resulting graph G(A,r) are the 
monomials of the form Xi 1 j 1 ■ ■ ■ Xi r+2 j r+2 such that ik / %i and jk ^ ji for all k ^ I. Each such 
monomial is a term of a unique (r + 2) x (r + 2) minor. Each (r + 2) x (r + 2) minor belongs to 
AM which can be verified by differentiating the Laplace expansion of the determinant. As each 
minor is a circuit, we deduce that these (r + 2) x (r + 2) minors span the prolongation. □ 

4. Prolongations and Secant Varieties 

In this section we will explain the relationship between prolongation and secant varieties. The 
proof of Lemma 2.2 in [13] goes through in a more general setting, and our proof of Theorem 
14.11 follows along these lines. Although we could also use the ideas of the proof of Lemma 2.2 in 
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|13j to prove Theorem 14.2^ we give an alternate and simpler proof appealing to the computation 
of joins of ideals. 

Theorem 4.1. Suppose that X C p n_1 is a variety over an algebraically closed field, and 
I = I(X). Let A = Id . Then ^4(( r ~ 1 )( d_1 )) is contained in the ideal of the r-th secant variety of 
X. 

Proof. Suppose that F E A(( r_1 K rf-1 )) so that 

degF = (d - l)(r -l)+d = dr-r-d+l + d = dr-r + l = r(d-l) + l. 

A general point on the r-th secant variety of X is the span of r points of X. So, let v = 
^i v i + ■ • - + t r v r where the ti and Vj are indeterminates. We will show that for any specialization 
of the Vj to points of X, and t{ E K, F(v) = 0. Since 

FW= (^-l) + D! F(X '---' X) ' 
F(v) = if and only if F(v, . . . , v) = 0. 

The point now is that the polarization F(xi, . . . jXt-m— i)+i) is linear in each set of variables 
Xj. This implies that 



r 



F(xi, . . . ,Xj_i, V,X i+ i . . . ,X r (d-l)+l) — 2^.*j F ( x l> ' ' ' ' x «-l> v j; x «+l) • • • > x r(d-l)+l)- 

Therefore, we see that 

F(v,...,v)= ■£ (* d -? + 1 )***.....*). 

f3eN r ,\/3\=r(d-l)+l ^ ' 

where Vj is repeated $ times. For each (3 in the sum, \(3\ = r(d— 1) + 1, implies that some Pi > d. 
Therefore, F(v^ 1 , . . . , v(? r ) can be written as a polynomial whose coefficients have degree d in 
Vj. Since F E 

A ((r-l)(d-l)) 

, every coefficient of a monomial in the y- variables in the polynomial 
F(x, . . . , x, yi, . . . , y( r -i)(d-l)) is m A by part (3) of Lemma 12. 71 Therefore, each of these degree 
d coefficients is in A (written in the Vj-variables). Thus, if we specialize all of the Vj to points 
of X, F(vf , . . . , vf r ) = 0. We conclude that F(v, . . . , v) = 0. □ 

We also have the partial converse if we know that the ideal of X does not contain any forms 
of degree < d. 

Theorem 4.2. Suppose that X C P n_1 is a variety over an algebraically closed field and that 
no form of degree < d — 1 vanishes on X. If m = r(d — 1) + 1, then I(X^) m = ^4(( r_1 )( (i_1 )) . 

To prove the theorem, we collect some general definitions and results about secants and joins 
of ideals. Given a collection of ideals I±,...,I r C K[x], their join is obtained by constructing 
the new ideal 



I x * • • • * I r = Ji(yi) H h J r (y r ) + ( Xj - yij | j E [n 



i=l 
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where yj denotes the vector of variables Vj = (yn, ■ ■ ■ ,yi n ), h(yi) denotes the ideal Ij with 
variable yij substituted for variable Xj, and the big ideal in parentheses is contained in the 
ring K[x,yi, . . . , y r ]. The r-fold join of I with itself is the r-th secant ideal 1^ = I * ■ ■ ■ * I. 
If Ii,...,I r are saturated homogeneous radical ideals over an algebraically closed field, then 
Ii* ••• * I r is the homogenous ideal representing the embedded join of the projective varieties 
V(Ii), . . . , V(I r ). The secant ideal 1^ is the vanishing ideal of the r-th secant variety of V(I). 
Note that the join construction is commutative and associative, and it respects containments. 
Proposition 14.31 due to Simis and Ulrich, puts a restriction on the degrees of forms which may 
appear in the secant ideal, and Proposition 14.41 describes how symbolic powers may be computed 
via the join operation. 

Proposition 4.3. [T5] If I C (x\, . . . , x n ) d then 

/f>C(x 1) ...,x n )^- 1 ) +1 . 

Proposition 4.4. [19] Suppose that K is algebraically closed. If I is a radical ideal, then 
I<r)=I*{x 1 ...,x n ) r . 

These results lead us to a more general result involving containments of secant ideals in 
symbolic powers. Theorem 14.21 reflects information about one graded piece of this containment. 

Proposition 4.5. Suppose that K is algebraically closed, I is radical, and I C (x\, . . . , x n ) d . 
Then 

jM c j((r-l)(d-l)+l) < 
Proof. This follows by the chain of containments 

jW = / * C / * ( Xl , . . .,X n f-^ d -^ +1 = 

The first equality is by the associativity of the join, the second containment follows because joins 
respect containment together with Proposition 14.31 and the third equality follows by Proposition 
PI □ 

Proof of Theorem \4-%\ ' This is a direct consequence of Proposition 14.51 and Corollary 12.101 □ 
Proof of Theorem \1.2c This is a direct consequence of Theorems 14.11 and 14.21 □ 

Remark. The proofs of Theorems 14.11 and 14.21 can be extended to the nonreduced case. To do 
this requires the replacement of the symbolic power with the differential power, and some more 
algebraic reasoning in the proof of Theorem 14. 11 We have only included the proof of the reduced 
case because it is, by far, the most interesting. 

5. Application to the Binary Symmetric Model 

As mentioned in the introduction, one recent motivation for the detailed study of equations 
vanishing on secant varieties comes from algebraic statistics, where secant varieties correspond 
to statistical models called mixture models. Our goal in this section is to illustrate how pro- 
longations can be used to derive some nontrivial algebraic constraints on mixture models in 
situations where it seems difficult to prove directly that the same equations belong to the secant 
ideal. In particular, we explore this problem for some models arising in phylogenetics. 
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To give our description of equations in the prolongation, we first need to describe the space 
of quadrics which generate the ideal of the phylogenetic model. The bulk of this description 
can be found in [TTj. First we describe the variables of the toric ideal. Let T be an unrooted 
trivalent tree (that is, each vertex of the tree that is not a leaf has degree three) with n leaves. 
The ideal of the phylogenetic model It lives in the polynomial ring in 2 n_1 indeterminates: 

K[q] := K[ Qi | i G (Z/2Z)£ ven ] 

where (Z/2Z)g Ven is the group of binary strings of length n with sum zero in Z/2Z. These q 
indeterminates are the Fourier transform of the natural probability coordinates (see |17j). 

The ideal It is generated by determinantal quadrics. Specifically, each internal edge e of 
the tree induces a split of the leaves into two disjoint sets, A\B. The indeterminates are also 
partitioned into two disjoint sets, namely, the sets 

{ qi | ^ ij = G Z/2Z} and { qi | J^ij = 1 € Z/2Z}. 

jeA jeA 

These two sets of indeterminates fit into two 2I" 4 ! -1 x 2^ B \~ 1 matrices, Mq and Mf , whose rows are 
indexed by the strings \a and whose columns are indexed by \b- The toric ideal It is generated 
by the set of all 2 x 2 minors of the matrices Mq and Mf as e ranges over all the internal edges 
of T. Let At denote the space of quadrics generated by the determinants described above. 

Example 5.1. For instance if T is the trivalent tree with four leaves with unique internal split 
12 1 34 then we have 

M e = I 90000 90011 | M e = I 90101 90110 I 

° \9ll00 91111/ 1 V^ 1001 91010/ 

and It is a complete intersection of quadrics. □ 

While the description given thus far is rich enough to describe generators of the ideals It, 
we need a more involved combinatorial description of the indeterminates in the ring M,[q] to 
provide a characterization of the polynomials in the prolongation. The crucial observation is 
that associated to each indeterminate (ft is a labeling of all edges in the tree T, by zeros and 
ones. An edge gets the label YljeA(e) h ^ Z/2Z where A(e) is one part of the split induced by 
the edge e. Note that this labeling naturally corresponds to a set of disjoint paths through the 
tree T such that the end points of every path are leaves of the tree. Conversely, every such set of 
disjoint paths is the associated labeling of some indeterminate q\. Thus, for each such labeling 
L we get an indeterminate qi. In [5], these unions of paths are called sockets. We will use these 
path indeterminates in the remainder of the section. 

Now we wish to describe generators of the prolongation of the space of quadrics we have 
described, which we do in terms of the path indeterminates from above. A frame F is a partial 
labeling of the tree T where the labels have been assigned to a trivalent subtree T(F) of T. 
The frame has active edges a{F) which are the leaves of T(F) that are not leaves of T. Each 
active edge e induces a subtree T e (F) C T consisting of all edges on the side of e that does not 
contain T(F). Let L e denote the set of all possible labelings of T e {F) that are compatible with 
the frame F (that is, can be completed to a variable). If F is a frame and e is an edge of F that 
has been assigned, let F(e) be the label assigned to the edge e. 
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Definition 5.2. A collection of frames F\, . . . , F^ together with a function e(-, •) : (t? ) — > E(T) 
is compatible if e(-,-) satisfies: 

(1) For all (i,j) G (If), e(i, j) G o(*i) n a(i^) with Fi(e(i,j)) = Fj(e(i,j)); 

(2) Jle(i,j) = e(j,k) then e(i,j) = e(i,k); and 

(3) For all j G [<f], U ¥j e(i, j) = a(iT,-). 

The function e(-, •) determines an equivalence relation on the set of pairs (Fi, e) with e G a(Fi) 
where (Fi, e) is defined to be equivalent to itself, and if i ^ j, then we define (Fi, e%) ~ (Fj, e-i) 
if e(i,j) = ei = e%. Let E denote such an equivalence class and let C(E) C L e be a set of 
distinct labelings of T e (F) compatible with the Fi G E. Given all these data (in particular, 
the frames and the labeling sets C(E)) we define a polynomial of degree d. To do this, fix a 
particular base ordering on each of the sets C(E). Now for each E, take some permutation of 
C(E). This set of permutations can be used to complete all the frames F\, . . . ,F^ in a unique 
ordered way. This is accomplished by adding the first element of each C(E) to the frame Fj 
that appears first in the equivalence class E and so on. Thus each set of permutations yields 
a monomial in the q^. Denote by P(F\, . . . , F&; C(E\), . . . , C(Ef.)) the signed sum of all such 
monomials where the sign of a monomial is the product of the signs of the each permutations 
used to form the monomial. 



Example 5.3 (The six- leaf snowflake). Let T be the six-leaf tree depicted below and let A be 
the span of the quadratic binomials which generate It- 



2 1 




4 5 



Computing and A^ with Macaulay 2 [TT], we see that A^ is spanned by 32 8-term 
cubics and A^ is spanned by a single 64-term quartic form. As we will see, in both cases, the 
construction described above yields the entire prolongation. 

To construct a cubic, we need to choose three frames with compatible labelings. In the 
diagram below our choice of frame is depicted in bold. 
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We show triples of trees that correspond to the first three terms of the 8-term cubic 

<7oiiooo<Zioooio<Zmm — <Zioiooo<Zoiooio<7mm 

— ^OllOll^lOOOlO^llllOO + 9l0101l9010010 ( /llll00 

— <Zoiiooo9iomo9iiooii + 9ioiooo<7oimo<?iiooii 
+goiioii<Zromo<Ziioooo — 9ioioii9oiiiio9iioooo 

We get 32 such cubics because there are 4 ways of choosing 3 distinct inner edges and two 
ways of labeling each of the pairs of cherries attached to the three inactive edges on the frames. 

Our 64-term quartic is constructed by choosing the 4-tuple consisting of our 4 distinct choices 
of frames and completing each edge-labelling in any way allowed. There is only one way to 
define the function e(-, •) and each edge on each frame is active. 

Example 5.4 (The six-leaf caterpillar). Let T be the six-leaf caterpillar-shaped graph below. 




The corresponding toric variety has ideal It generated by the 2x2 minors of four 2x8 and 
two 4x4 matrices. Using Macaulay 2 [llj . one can see that A^ is spanned by 32 6-term cubics 
and that is spanned by two 24-term quartics. These forms may be constructed using the 
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methods above although it is easy to see that they are the 3x3 and 4x4 minors of the 4x4 
matrices used to define It- 

Theorem 5.5. For any set of compatible frames T = F\, . . . , F^, compatibility function e(-, •), 
and completions C = C(E\), . . . , C(E) C ), the polynomial P(J-;C) is in the prolongation 2 \ 

Proof. The proof is by induction on d. When d = 2, P(T;C) yields the description of a 2 x 2 
minor of a matrix ^phf^)) associated to the unique common active edge in the two frames. So 
suppose the statement is true for d — 1. The result will follow if we show that the derivative of 
the degree d polynomial P{J-; C) with respect to any variable is the sum of polynomials of the 
form P{J-'\ C) of degree d — 1. 

Let T = Fi,...,Fd and qi be any variable appearing in a monomial in P(JF\C). By our 
construction of P(.F;C), each occurrence of qi is associated to a frame in T. Without loss 
of generality, assume that qi arises by completing the labeling of the frame Fj. (It may be 
associated to other frames as well.) In each monomial in which qi appears by completing a 
labeling of F^, the d — 1 other factors come from the frames F±, . . . F^— i- Now construct the 
new set C(Ei)', . . . , C{EjS)' by removing the elements from C{E{), . . . , C(E/ : ) that are used to 
make qi- If any of the sets C(Ei)' are singletons, we can take this single element and modify the 
appropriate frame Fi to get a new frame, and remove the set C(Ei)' from our list of completions. 
Carrying out this procedure yields a set of frames T' = F[, . . . , F[ 1 _ 1 and a set of completions 
C = C(E[), . . . C(E' r ), such that upon dividing all monomials in C) that have this particular 
realization of qi (associated to the frame F^) we get the polynomial P(J 7/ ;C / ). Applying this 
argument to all realizations of qi by different frames, we deduce that the derivative of P(^ r ;C) 
with respect to qi is the (signed) sum of polynomials Pi^F'^C). □ 

Remark. Note that the argument in the preceding proof holds even when qi appears to a power 
> 1, because the coefficient of the derivative will account for the different frames that yield qi- 
It is interesting to note that this exceptional case cannot occur, however, because A is generated 
by polynomials with all squarefree terms. Thus, A^ d ~ 2 ^ is also generated by polynomials with 
all squarefree terms. 
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